Distributed Compression of Graphical Data
نویسندگان
چکیده
In contrast to time series, graphical data is indexed by the vertices and edges of a graph. Modern applications such as internet, social networks, genomics proteomics generate data, often at large scale. The scale argues for need compress storage subsequent processing. Since this might have several components available in different locations, it also important study distributed compression data. paper, we derive rate region problem which counterpart Slepian–Wolf theorem. We characterize when statistical description can be modeled being one two types – member sequence marked sparse Erdős–Rényi ensembles or configuration model ensembles. Our results are terms generalization notion entropy introduced Bordenave Caputo local weak limits graphs. Furthermore, give result with more than sources.
منابع مشابه
Distributed Compression of Graphical Data
In contrast to time series, graphical data is data indexed by the nodes and edges of a graph. Modern applications such as the internet, social networks, genomics and proteomics generate graphical data, often at large scale. The large scale argues for the need to compress such data for storage and subsequent processing. Since this data might have several components available in different locatio...
متن کاملExploring Data Compression in Distributed File Systems
The ever-growing imbalance between computation and I/O is one of the fundamental challenges for current petascale and future exascale systems. As these large systems become more compute heavy by design, it is becoming harder to support dataintensive applications with adequate performance. To narrow the gap between computation and I/O, parallel and distributed file systems have been introduced. ...
متن کاملGraphical Model-Based Algorithms for Data Association in Distributed Sensing
Associating sensor measurements with target tracks is a fundamental and challenging problem in multi-target tracking. The problem is even more challenging in the context of sensor networks, since association is coupled across the network, yet centralized processing is in general infeasible due to power and bandwidth limitations. Hence efficient, distributed solutions are needed. We propose tech...
متن کاملWeighting of Double Exponential Distributed Data in Lossless Image Compression
In this paper we introduce a so-called probability assignment technique for double exponential data based on weighting. Results for lossless compression of image data show that this improves the compression performance over methods based on probability distribution estimation. The weighting technique gives a robustness that aaects the compression favorably .
متن کاملEntropy-based Consensus for Distributed Data Clustering
The increasingly larger scale of available data and the more restrictive concerns on their privacy are some of the challenging aspects of data mining today. In this paper, Entropy-based Consensus on Cluster Centers (EC3) is introduced for clustering in distributed systems with a consideration for confidentiality of data; i.e. it is the negotiations among local cluster centers that are used in t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Information Theory
سال: 2022
ISSN: ['0018-9448', '1557-9654']
DOI: https://doi.org/10.1109/tit.2021.3129189